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Abstract 



Let A be a finite measure on the unit interval. A A-Fleming-Viot process is a proba- 
bility measure valued Markov process which is dual to a coalescent with multiple collisions 
(A-coalescent) in analogy to the duality known for the classical Fleming- Viot process and 
, Kingman's coalescent, where A is the Dirac measure in 0. 

OA ' We explicitly construct a dual process of the coalescent with simultaneous multiple colli- 

sions (S-coalescent) with mutation, the S-Fleming-Viot process with mutation, and provide 
' a representation based on the empirical measure of an exchangeable particle system along the 

p ^ I lines of Donnelly and Kurtz (1999). We establish pathwise convergence of the approximating 

systems to the limiting S-Fleming-Viot process with mutation. An alternative construction 
of the semigroup based on the Hille-Yosida theorem is provided and various types of duality 
' of the processes are discussed. 

In the last part of the paper a population is considered which undergoes recurrent bot- 
tlenecks. In this scenario, non-trivial S-Fleming-Viot processes naturally arise as limiting 
. models. 
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1 Introduction and main results 

X ■ 1-1 Motivation 

One of the fundamental aims of mathematical population genetics is the construction of pop- 
ulation models in order to describe and to analyse certain phenomena which are of interest 
for biological applications. Usually these models are constructed such that they describe the 
evolution of the population under consideration forwards in time. A classical and widely used 
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model of this kind is the Wright-Fisher diffusion, which can be used for large populations to 
approximate the evolution of the fraction of individuals carrying a particular allele. On the 
other hand it is often quite helpful to look from the present back into the past and to trace back 
the ancestry of a sample of n individuals, genes or particles. In many situations, the Kingman 
coalescent |K82al IK82bj turns out to be an appropriate tool to approximate the ancestry of 
a sample taken from a large population. It is well known that the Wright-Fisher diffusion is 
dual to the block counting process of the Kingman coalescent [ D861 IMOT] . More general, the 
Fleming- Viot process [FV79j . a measure- valued extension of the Wright-Fisher diffusion, is dual 
to the Kingman coalescent. Such and similar duality results are quite common in particular in 
the physics literature on interacting particle systems |L85] and in the more theoretical literature 
on mathematical population genetics [AHOTl IMoKl IDK961 IDK991 IEK951 [Hnnl MM MM] . Don- 
nelly and Kurtz |DK96j established a so-called lookdown construction and used this construction 
to show that the Fleming- Viot process is dual to the Kingman coalescent. This construction 
and corresponding duality results have been extended |DK991 IBLG03[ IBLG051 IBLG06j to the 
A-Fleming-Viot process, which is the measure-valued dual of a coalescent process allowing for 
multiple collisions of ancestral lineages. For more information on coalescent processes with 
multiple collisions, so-called A-coalescents, we refer to Pitman |P99] and Sagitov |S99j . 

There exists a broader class of coalescent processes [MSOll ISOOl IS03j in which many multiple 
collisions can occur with positive probability simultaneously at the same time. These processes 
can be characterized by a measure H on an infinite simplex and are hence called H-coalescents. 
It is natural to further extend the above constructions and results to this full class of coalescent 
processes and, in particular, to provide constructions of the dual processes, called H-Fleming- 
Viot processes. Although such extensions have been briefly indicated in |DK99j and |BLG03j . 
these extensions have not been carried out in detail yet. S-coalescents have also recently been 
applied to study population genetic problems, see |TV08l [SW08j . 

The motivation to present this paper is hence manifold. We explicitly construct the H- 
Fleming-Viot process and provide a representation via empirical measures of an exchangeable 
particle system in the spirit of Donnelly and Kurtz |DK96t rDK99j . We furthermore establish 
corresponding convergence results and pathwise duality to the H-coalescent. We also provide an 
alternative, more classical functional-analytic construction of the S-Fleming-Viot process based 
on the Hille-Yosida theorem and present representations for the generator of the H-Fleming-Viot 
process. Our approaches include neutral mutations. The results give insights into the pathwise 
structure of the H-Fleming-Viot process and its dual H-coalescent. Examples and situations 
are presented in which certain H-Fleming-Viot processes and their dual H-coalescents occur 
naturally. 

1.2 Moran models with (occasionally) large families 

Consider a population of fixed size € N := {1,2, . . .} and assume that each individual is 
of a certain type, where the space E of possible types is assumed to be compact and Polish. 
Furthermore assume that for each vector k = {ki, k2, ■ ■ ■) of integers satisfying A;i > A;2 ^ ■ ■ ■ ^ 
and ki < N a, non-negative real quantity rAr(k) > is given. The population is assumed 

to evolve in continuous time as follows. Given a vector k = (ki, . . . , km, 0, 0, . . .), where ki > 
■ ■ ■ > km > 1 and ki + ■ ■ ■ + km < N, with rate rjv(k) we choose randomly m groups of sizes 
ki, . . . , km from the present population. Inside each of these m groups we furthermore choose 
randomly a 'parent' which forces all individuals in its group to change their type to the type of 
that parent. We say that a k-reproduction event occurs with rate r7v(k). The classical Moran 
model corresponds to rjv(2, 0, 0, . . .) = N. 

Except for the fact that these models are formulated in continuous time, they essentially coin- 



2 



cide with the class of neutral exchangeable population models with non-overlapping generations 
introduced by Cannings |C74t [C75] . Starting with the seminal work of Kingman [K82al lK82b| . 
the genealogy of samples taken from such populations is well understood, in particular for the 
situation when the total population size N tends to infinity. 

1.3 Genealogies and exchangeable coalescents 

For neutral population models of large, but fixed population size and finite-variance reproduc- 
tion mechanism, Kingman [K82aj showed that the genealogy of a finite sample of size n can 
be approximately described by the so called n-coalescent (11^°' )t>o- The n-coalescent is a 
time- homogeneous Markov process taking values in Vn, the set of partitions of {1, ... ,n}. If i 
and j are in the same block of the partition then they have a common ancestor at time 

t ago. Hq"'^"^ is the partition of {1, . . . , n} into singleton blocks. The transitions are then given 
as follows: If there are b blocks at present, then each pair of blocks merges with rate 1, thus 
the overall rate of seeing a merging event is (2) ■ Note that only binary mergers are allowed and 
that at some random time, all individuals will have a (most recent) common ancestor. 

Kingman [K82a| also showed that there exists a P^-valued Markov process {Ilf°)t>o, where 
Vfii denotes the set of partitions of N. This process, the so-called Kingman coalescent, is char- 
acterised by the fact that for each n the restriction of {Ilf°)t>o to the first n natural numbers is 
the n-coalescent. The process can be constructed by an application of the standard Kolmogoroff 
extension theorem, since the restriction of every n-coalescent to {1, . . . ,m}, where 1 < m < n, 
is an m-coalescent. 

Whereas the Kingman coalescent allows only for binary mergers, the idea of a time-homo- 
geneous "PN-valued Markov process that evolves by the coalescence of blocks was extended by 
Pitman [P99j and Sagitov [S99 J to coalescents where multiple blocks are allowed to merge at the 
same time, so-called A-coalescents, which arise as the limiting genealogy of populations where 
the variance of the offspring distribution diverges as the population size tends to infinity. Mohle 
and Sagitov [MSOl] and Schweinsberg [SOOj introduced the even larger class of coalescents with 
simultaneous multiple collisions, also called exchangeable coalescents or H-coalescents, which 
describe the genealogies of populations allowing for large family sizes. 

Schweinsberg |S00j showed that any exchangeable coalescent (n.f)t>o is characterised by a 
finite measure H on the infinite simplex 

A := {C=(Ci,C2,...):Ci>C2>--->0,E£iO<l}. 

Throughout the paper, for e A, the notation j^| := '^'^i Ci and (C, C) '■= Ci will be used 
for convenience. Note that Mohle and Sagitov [MSOlj provide an alternative (though somewhat 
less intuitive) characterisation of the H-coalescent based on a sequence of finite symmetric 
measures {Fr)r£N- Coalescent processes with multiple collisions (A-coalescents) occur if the 
measure H is concentrated on the subset of all points C S A satisfying Ci = for alH > 2. The 
Kingman-coalescent corresponds to the case 3 = 60. It is convenient to decompose the measure 
S into a 'Kingman part' and a 'simultaneous multiple collision part', that is, H = a6o -f-Hg with 
a := H({0}) E [0, 00) and Ho({0}) = 0. The transition rates of the H-coalescent 11" are given as 
follows. Suppose there are currently b blocks. Exactly X][=i blocks collide into r new blocks, 
each containing ki, . . . ,kr > 2 original blocks, and s single blocks remain unchanged, such that 
the condition Y2l=i ki + s = b holds. The order of ki, . . . ,kr does not matter. The rate at which 
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the above coUision happens is then given as (Schweinsberg \S00\ Theorem 2]) 



Afe;fei,...,fe,;s = all^r = lM=2} + j^^l^'^ - \C\y ' E Ch ■■■(irCir + l■■■Cir + l-^-^■ 

(1.1) 

An intuitive explanation of (jl.ip is given below in terms of Schweinsberg's |S00| Poisson process 
construction of the H-coalescent. If H(A) ^ 0, then without loss of generality it can be assumed 
that H is a probability measure, as remarked after Eq. (12) of [SOO] . Otherwise simply divide 
each rate by the total mass H(A) of S. 

1.4 Poisson process construction of the S-coalescent 

It is convenient to give an explicit construction of the H-coalescent in terms of Poisson processes. 
Indeed, Schweinsberg [H00| Section 3] shows that the H-coalescent can be constructed from a 
family of Poisson processes {^i^}i,jeN,«<j and a Poisson point process 971"" on R+ x A x [0, 1]^. 
The processes have rate a = S({0}) each and govern the binary mergers of the coalescent. 
The process has intensity measure 

dt^=^^illo,i]mf''- (1.2) 

These processes can be used to construct the H-coalescent as follows: Assume that before the 
time tm the process 11 is in a state {-61,-62, . . .}. If is a point of increase of one of the processes 
(and there are at least i V j blocks), then we merge the corresponding blocks Bi and Bj 
into a single block (and renumber). This mechanism corresponds to the Kingman-component 
of the coalescent. 

The non-Kingman collisions are governed by the points 

{imjCmi^m) — (^m , (Cml i Crri2 ;•••)' (^ml ) ;•••) ) (l'^) 

of the Poisson process SDT"°. The random vector Cm denotes the respective asymptotic family 
sizes in the multiple merger event at time and the are "uniform coins" , determining the 
blocks participating in the respective merger groups; see (12. 2p or [SOOl Section 3] for details. 



1.5 S-Fleming-Viot processes 

An in many senses dual approach to population genetics is to view a population of finite size 
as a vector of types {Y^ , . . . , Yff) with values in or as an empirical measure of that vector 
W YliLi ^Y'^ ^^'^ 1°°^ evolution under mutation and resampling forwards in time. When 

N tends to infinity one obtains the Fleming- Viot process [FV79| . This process has been extended 
to incorporate other important biological phenomena and has found wide applications, see 
|EK93j for a survey. 

Donnelly and Kurtz |DK96j embedded an i?°°-valued particle system into the classical 
Fleming- Viot process, via a clever lookdown construction, and showed that it is dual to the 
Kingman-coalescent. This construction and the duality has been extended to the so-called A- 
Fleming-Viot process, dual to the A-coalescents, and investigated by several authors, see, e.g., 
|DK99l IBBC05[ IBLG031 IBLG051IBLG06] . or [BB07j for an overview. 

Let / € Cb{EP), /i G Mi{E) and G/(/u) := The generator of the A-Fleming-Viot 

process without mutation has the form (see (BBC05t Equation (1.11)]) 

JC{l,...,p},|J|>2 
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where 





min( J) 



otherwise. 



if z G J, 



(1.5) 



Note that ()1.4p includes the generator of the classical Fleming- Viot process (without mutation) 
if the summation is restricted to sets J satisfying \J\ = 2. 

Our aim in this paper is to present the modified lookdown construction for a measure- valued 
process that we will call the S-Fleming-Viot process with mutation, or the (S, B)-Fleming-Viot 
process. The symbol B stands here for an operator describing the mutation process. We will 
establish its duality to the H-coalescent with mutation. The modified lookdown construction 
will also enable us to establish some path properties of the (H, i?)-Fleming-Viot process. 

1.6 A modified lookdown construction of the (S, S)-Fleming-Viot process 

Consider a population described by a vector {t) = {Y^ {t), . . . ,Y^ [t]) with values in , 
where Y^ [t) is the type of individual i at time t. The evolution of this population (forwards 
in time) has two components, namely reproduction and mutation. During its lifetime, each 
particle undergoes mutation according to the bounded linear mutation operator 



where / is a bounded function on E, q{x,dy) is a Feller transition function on £^ x B{E), and 
r > is the global mutation rate. 

The resampling of the population is governed by the Poisson point process dJl^" , which was 
introduced as a driving process for the H-coalescent. In particular, the resampling events allow 
for the simultaneous occurrence of one or more large families. The resampling procedure is 
described in detail in Section [2l An important fact is that this resampling is made such that it 
retains exchangeability of the population vector. 

In Section [21 we introduce another particle system = {X^ , . . . , X^) again with values in 
E^ . Each particle mutates according to the same generator (jl.6p as before. For the resampling 
event, we will use the same driving Poisson point process 9Jl"°, but we will use the modified 
lookdown construction of Donnelly and Kurtz introduced in |DK99j . suitably adapted to our 
scenario. This (H, i?)-lookdown process will be introduced in Section \2.2[ It is crucial that the 
resampling events retain exchangeability of the population vector and that the process {X^(i)} 
has the same empirical measure YliLi '^x^(t) process {Y^{t)}. 

The construction of the resampling events allows us to pass to the limit as tends to infinity 
and obtain an E°° -valued particle system X = {Xi, X2, ■ ■ ■)■ Since this particle system is also 
exchangeable, this procedure enables us to access the almost sure limit of the empirical measure 
as N tends to infinity by the De Finetti Theorem (which is not possible for the Y^). 

1.7 Results 

Let T>{B) denote the domain of the mutation generator B and let /i, /2, • • • G ^(B) be functions 
that separate points of A4i{E) in the sense that J fkdfi = J fkdv for all /c G N implies that 
^ = V. Such sequences exist, see, e.g. Section 1 (Lemma 1.1 in particular) of |DK96j . We use 
the metric d on M.i{E) defined via 




(1.6) 




k 




(1.7) 
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and equip the topology of locally uniform convergence on -Dxi(_e)([0) oo)) with the metric 

/•oo 

dp{ti,u) := / e'^d{n{t),u{t))dt. (li 



Theorem 1.1. The A4i{E)-valued process {Zt)t>o, defined in terms of the ordered particle 
system X = {X^,X^,...) by 



1 " 

Zt := lim = lim-V5x,(t), t > 0, 



n^oo n 

is called the H-Fleming-Viot process with mutation operator B or simply the (H, i3)-Fleming- 
Viot process. Moreover, the empirical processes (Z")j>o converge almost surely on the path 
space D_A/(^(£;)([0, oo)) to the cddldg process {Zt)t>o- 

Since the empirical measures ofX^ and are identical, we arrive at the following corollary. 

Corollary 1.2. Define, for each n. 



1 



n 

i=l 



the empirical process of the n-th unordered particle system, and assume that Zq — > Zq weakly 
as n oo. Then, (Z")t>o converges weakly on the path space D_/v4j(£;)([0, oo)) to the {E,B)- 
Fleming-Viot process {Zt)t>o- 

The Markov process {Zt)t>o is characterized by its generator as follows. 

Proposition 1.3. The (3, B)-Fleming-Viot process {Zt)t>Q is a strong Markov process. Its 
generator, denoted by L, acts on test functions of the form 



G/(/i) := / /(xi,...,x„)/i®"(dxi,...,dx„), fieMiiE), (1.9) 
Je" 

where f : E"' is bounded and measurable, via 

LGfif,) := L'^'^Gfiii) + L^^'Gfiji) + L'^Gfifi), (1.10) 

where 

L^-^^Gfifj.) := a ^ / (^/(xi,.., Xi,.., Xi,.., x„) - /(xi,.., x^,.., x„))/x®"(dx), (1.11) 



L^^Gfif,) := I I [GH(l-|C|)/i + E£iCA.)-GH/i)]^^^dx)=^, (1.12) 

n „ 

L^Gj{ii):=rY, i3.(/(xi, . . . , x„))/.^"(dx), (1.13) 
i=i -^E" 

and Bif is the mutation operator B, defined in lil.b]) . acting on the i-th coordinate of f . 

Remark 1.4. 1) In [DK99| . Donnelly &: Kurtz established a construction and pathwise duality 
for the A-Fleming-Viot process. In some sense, their paper works under the general assumption 
"allow simultaneous and/or multiple births and deaths, but we assume that all the births that 
happen simultaneously come from the same parent" (p. 166), even though they very briefly 
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in Section 2.5 mention a possible extension to scenarios with simultaneous multiple births to 
multiple parents. In essence, the present paper converts these ideas into theorems. 

2) Note that in a similar direction, Bertoin & Le Gall remark briefly on p. 277 of |BLG03j 
how their construction of the A-Fleming-Viot process via flows of bridges can be extended to 
the simultaneous multiple merger context (but leave details to the interested reader). We are 
not following this approach, as it is hard to combine with a general type space and general 
mutation process. 

3) The H-Fleming-Viot process has recently been independently constructed by Taylor and 
Veber (personal communication, 2008) via Bertoin and Le Gall's flow of bridges (see [BLGOSj ) 
and Kurtz and Rodriguez' Poisson representation of measure-valued branching processes (see 
[KR08]). In this context we refer to Taylor and Veber |TV08] for a larger study of structured 
populations, in which H-coalescents appear under certain limiting scenarios. 

4) Note that the modified lookdown construction of the A-Fleming-Viot process contains all 
information available about the genealogy of the process and therefore also provides a pathwise 
embedding of the A-coalescent measure tree considered by Greven, Pfaffelhuber and Winter 
|GPW07] . A similar statement holds for the H-coalescent. 

The rest of the paper is organised as follows: In Section [2] we use the Poisson point process 
to introduce the finite unordered (H, i?)-Moran model and the finite ordered (H, B)- 
lookdown model . It is shown that the ordered model is constructed in such a way that we 
can let tend to infinity and obtain a well defined limit. We will also show that the reordering 
preserves the exchangeability property, which will be crucial for the proof in Section [3l In this 
section, we will introduce the empirical measures of the process and , show that they are 
identical and converge to a limiting process having nice path properties, which is the statement 
of Theorem II. 1[ 

Section 14.21 will be concerned with the generator of the Ho-Fleming-Viot process. We will 
give two alternative representations and show that it generates a strongly continuous Feller 
semigroup. Furthermore, we will show that the process constructed in Section [3] solves the 
martingale problem for this generator. 

One representation of the generator will then be used in Section [5] to establish a functional 
duality between the H-coalescent and the H-Fleming-Viot process on the genealogical level. Due 
to the Poissonian construction, this duality can also be extended to a "pathwise" duality. We 
will also give a function-valued dual, which incorporates mutation. 

In Section El we look at two examples: The first example is concerned with a population 
model with recurrent bottlenecks. Here, a particular H-coalescent, which is a subordination of 
Kingman's coalescent, arises as a natural limit of the genealogical process. The second example 
discusses the Poisson-Dirichlet-coalescent and obtains explicit expressions for some quantities 
of interest. 

2 Exchangeable £'°°-valued particle systems 
2.1 The canonical (S, i?)-Moran model 

We can use the Poisson process from Section 11.41 governing the H-coalescent to describe a 
corresponding forward population model in a canonical way, simply reversing the construction 
of the coalescent by interpreting the merging events as birth events. 
Consider the points 





)) 
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of 9Jt^° defined by (|1.2|) . The tm denote the times of reproduction events. Define 

giC, u) := h""^' I Ci + • • • + > if - < 0, ^2.2) 

I oo else. 

At time tm, the particles are grouped according to the values g{Cm-,i^mi), I = 1, ... as 
follows: For each A; G N, all particles I G {1, . . . , N} with g{Cm, Umi) = k form a family. Among 
each non-trivial family we uniformly pick a 'parent' and change the others' types accordingly. 
Note that although the jump times {tm) may be dense in the condition 



guarantees that in a finite population, in each finite time interval only finitely many non-trivial 
reproduction events occur. As above, each particle follows an independent mutation process, 
according to ()1.6p . in between reproductive events. 

We describe the population corresponding to the A^-particle (H,i?)-Moran model at time 
t > by a random vector 

y^(t) := (yi^(t),...,y^(t)) (2.3) 

taking values in . 

Remark 2.1. Note that this model is completely symmetric, thus, for each t, the population 
vector (t) is exchangeable ifY^{0) is exchangeable. 



2.2 The ordered model and exchangeability 

We now define an ordered population model with the same family size distribution, extending the 
ideas of Donnelly and Kurtz |DK99] in an obvious way. This time each particle will be attached 
a "level" from {1,2,...} in such a way that we obtain a nested coupling of approximating 
(H, i?)-Moran models as N tends to infinity. It will be crucial to show that this ordered model 
retains initial exchangeability, so that the limit as — > oo of the empirical measures of the 
particle systems, at each fixed time, exists by De Finetti's theorem. 

We will refer to this model as the the (H, i?)-lookdown-model. If the population size is A^, 
it will be described at time t by the E'^-valued random vector 

A^(t) := (Af(t),...,X^(t)). (2.4) 

The dynamics works as in the (H, i3)-Moran model above, including the distribution of family 
sizes and the mutation processes for each particle. 

In each reproduction step, for each family, a "parental" particle will be chosen, that then 
superimposes its type upon its family. This time, however, the parental particle will not be 
chosen uniformly among the members of each family (as in the (H, i?)-Moran model). Instead, 
the parental particle will always be the particle with the lowest level among the members of a 
family (hence each family member "looks down" to their relative with the lowest level). The 
attachment of types to levels is then rearranged as follows (see Figure [1] for an illustration): 

a) All parental particles of all families (including the trivial ones) will retain their type and 
level. 

b) All levels of members of families will assume the type of their respective parental particle. 
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(a) Parental particles retain 
type and level. 
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(b) Family members copy 
type of parental particle. 
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o 
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(c) Remaining particles re- 
tain their order and surplus 
particles get killed. 



Figure 1: An illustration of the reproduction mechanism in the (H, i?)-lookdown model. The 
particles at levels 2 and 5 belong to the "star" family, whereas the particles at levels 3, 6 and 8 
belong to the "triangle" family. The particles on the remaining levels belong to no family. 



c) All levels which are still vacant will assume the pre-reproduction types of non-parental 
particles retaining their initial order. Once all N levels are filled, the remaining types will 
be lost. 

In this way, the dynamics of a particle, at level I, say, will only depend on the dynamics of 
the particles with lower levels. This consistency property allows to construct all approximating 
particle systems, as well as their limit as — > oo, on the same probability space. 

Exchangeability of the modified (H, i?)-lookdown model is crucial in order to pass to the De 
Finetti limit of the associated empirical particle systems. For each A^, we will show that if X{0) 
is exchangeable, then X is exchangeable at fixed times and at stopping times. The proof will 
rely on an explicit construction of uniform random permutations @{t) which maps to Y^. 

Theorem 2.2. If the initial distribution of the population vector {X^ (0), . . . , X^[0)) in the 
(H, B)-lookdown-model is exchangeable, then [X^ (t), . . . , X^{t)) is exchangeable for each t > 0. 

For the rest of this section, we omit the superscript A^ for the population models in an 
attempt not to get lost in notation. 

The proof of Theorem 12.21 follows that of Theorem 3.2 in [DK99j . We will construct a 
coupling via a permutation- valued process Q{t) such that 

(yi(t), . . . , Yr,{t)) = (Ae,(t)(t), . . • ,Xe^(t)(t)) (2.5) 

and Q{t) is uniformly distributed on all permutations of {1, ... , A^} for each t and independent 
of the empirical process up to time t and the "demographic information" in the model (see 
(|2.15p for a precise definition). 

It suffices to construct the skeleton chain (^m)meNo of ©• As a guide through the following 
notation, we have found it useful to occasionally remember that @{t) (and its skeleton chain) 
is built to the following aim: 

maps a position of an individual in the vector Y ({3, B)-Moran-model) to the 
level of the corresponding individual in the ordered vector X 
(^(H, B)-lookdown-model). 
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Notation and ingredients For N > let Sn denote the collection of all permutations of 

{1, . . . , N}, let Pn = Vi{l, N}), the set of all subsets of {1, ... , N}, and let PM,k C Pjv be 
the subcoUcction of subsets with cardinality k. For a set M, M{i) will denote the ith largest 
element in M. 

At time m (for the skeleton chain) let Cm the total number of children. Let am be the 
number of families and the number of children born to family i, hence 

Cm- (2.6) 

Note that wc allow = for some, hnt not all i. These arc the trivial families where only 
the parental particle is below level N and all potential children are above. Furthermore, we 
need to keep track of these "one-member families" in order to match the rates of our model to 
those of the S-coalescent later on. 

Let 9q be uniformly distributed over Sn. For each m G N, pick (independently, and inde- 
pendent of ^o) 

• ^m ^ random set, uniformly chosen from PN,cm+ami 

• (^^, . . . , a random (ordered) partition of such that each (f)\^ has size 0^ + 1, 

• am, i = l,... 

,CLm random permutations, each cr^ uniformly distributed over S^d inde- 
pendently of ^m and the 0^. 

Denote 

• := min(^^, i G {!,..., a^}, and 

• write for the set of the highest Cm integers from {!,..., N} \ Uf^i Mm- 

Proceeding inductively we assume that Om-i has already been defined. We then construct 
9m as follows: Let 

• ■= 6'^Li(Am), and 

• a random ordered "partition" [tpm^ ■ ■ ■ ■> ''Prn') V'm such that IV'^I = cJ^j chosen indepen- 
dently of everything else. 

In view of our intended application of 6„i to transfer from the Moran model to the lookdown 
model, we will later on interpret these quantities as follows: In the m-th event, /i^ will be the 
level of the parental particle of family i in the lookdown-model, and Vm ^il^ be the correspond- 
ing index in the (unordered) Moran model. will specify the levels in the lookdown-model 
at which individuals die. We do not just pick the highest c„j levels, because we wish to retain 
parental particles, ipm will be the corresponding indices in the Moran model, {(pmi ■ ■ -> 'Prn ) 
describes the family decomposition (including the respective parents) in this event in the look- 
down model, and ifj]^ are the indices of the children in the i-th family in the Moran model. 
Thus, 6m will map to U {y^} (in a particular order). 

Finally, define 9m as follows: Put ^m '■= {^m^ ■ ■ ■ , U V'm- On ^m, 

em{yln) ■= cPini<il)), ^ = 1, ■ ■ ■ , a^, (2.7) 
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1 2 3 4 5 6 7 8 



1 2 3 4 5 6 7 8 



1 2 3 4 5 6 7 8 



2 5 8 3 7 1 6 4 

(a) Initial permuta- 
tion Om-l 



5 3 8 2 6 

(b) The families are 
added 



5 7 3 8 2 1 6 4 

(c) The completed 
permutation in Ex- 
ample 12.31 



Figure 2: The construction of the new permutation from the old permutation carried out in 
Example |" 



and 

On.{i^ln{j))--=^in(.<{j + ^)) Vj G {1, . . . , cj„} (2.8) 

for each i G {1, . . . , am} with ^ 0. On {1, . . . , N} \ let 6m be the mapping onto 
{1, . . . , A^} \ with the same order as Om-i restricted to {1, . . . , N} \ ^m, that is, whenever 
(j) for some i, j G {1, . . . , A^} \ "^m, then 6m{i) < dm{j) should also hold. 

Example 2.3. We consider a realisation of the m-th event of a population of size N = 8, 
as illustrated in Figure [TJ There are am = 2 families (depicted by "triangle" and "star" , 
respectively). The first family 4>l^ = {3,6,8} has size + 1 = 3, the second, (/)^ = {2,5}, 
has size -|- 1 = 2. Hence, the set of levels involved in this birth event is <I>m = {2, 3, 5, 6, 8}, 
and /i^ = 3, fi^ = 2 are the levels of the parental particles. Since there is no parental particle 
among the highest three levels, the particles at levels = {6,7,8} "die". 



Now let us assume that 6m-i is as given in Figure 2(a), Thus, = 4, = 1, ipm = 
{3,5,7}. The set of indices ipm of individuals in the Moran model who will get replaced by 
offspring in this event is partitioned according to the family sizes, for example let ~ l^' "^1 
and Tp"^ = {5}. 

We construct 9m as follows: Let o"^ = (3^2) ^^'^ = (2i)' restriction of 9m 

to "^m = {1,3,4,5,7}, we read from ([231) that 9m{4:) = 4>l,{3) = 8, 9m{l) = (pmi^) = 5 and 
from (USD that 0„(3) = 0„^(V^(1)) = </>^(fT^(l + 1)) = ^PU^) = 3, 9m{7) = 0m(V'^(2)) = 
0^(a^(2 + 1)) = 0^(2) = 6 and 0^(5) = 0„^(V^(1)) = (pliaUl + 1)) = C(l) = 2. This leads 
to the partial permutation which is given in Figure |2(b)[ 



Restricted to the complementary set {2, 6, 8}, 9m is a mapping onto {1,4, 7} with the same 



order as 9m-i restricted to {2,6,8}. The resulting permutation 9m is given in Figure 2(c) 
For notational convenience, let 

which summarises the combinatorial information generated in the m-th step (namely, the family 
structure we would observe in the Moran model). 

Lemma 2.4. For each m, xi, . . . ,Xm,9m are independent. Furthermore 9m is uniformly dis- 
tributed over Sn and 

Tm:=[j{i^L}Ui^'m (2.10) 
1=1 

is uniformly distributed over PN,cm+amJ o,nd each Xm is, given Tm, uniformly distributed on all 
ordered partitions ofTm with family sizes consistent with the Cm- 
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Proof. We prove the statement by induction. Denoting 

rm Xm mi Xn 



(^{SkiXk : < /c < m), we have 

1 ern~ll (2.11) 



since 9m and Xm are only based on Om-i and additional independent random structure. 
This implies, for any choice of / : 5„ ^ M and hk : U^^^ ({1, . . . , iV} x V{{1, . . . , iV}))" 



E 



f{em) n ^kiXk) 



k=l 



E 



m—l 



E 



k=l 

m—l 

n hkixk 

k=l 

m—l 

nf{Om)hm{xm)] n nhk{Xk)\, 



k=l 

where we used (|2.1ip in the second and the induction hypothesis in the third equality. It remains 
to show that 9m and Xm are independent and have the correct distributions. 

9m-i is uniformly distributed by the induction hypothesis and independent of the distri- 
butions of the parental-levels and the "death-levels" Am by construction. It is immediate 
from the construction that $m and Tm are uniformly distributed over PN,cm+am and the family 
structure Xm is uniformly distributed among all admissible configurations. 

Furthermore, conditioning on Xm and ^m, 9m is uniformly distributed over all permutations 
that map onto ^m- This follows from the fact that ^m is uniform on PN,c,„+am and that this 
set is uniformly divided into the families 4>m. Since uniform and independent permutations 
are used for the construction of 9m and the non-participating levels remain uniformly distributed, 
9m is uniform under these conditions. 

Finally, conditioning on Xm does not alter the fact that ^m is uniformly distributed over 
Pn c,n+am- This implies that given Xm-, 9m is also uniformly distributed over Sn- Since 



^^{Om\Xm) = Unif(S'Ar) 

9m and Xm are independent of each other. 



^{9m), 



(2.12) 



□ 



Proof of Theoreni \2.2l Suppose a realization X of the A^-particle (H, i?)-lookdown-model is 
given and let {tm} denote the times at which the birth events occur. The families involved 
in the m-th birth event are denoted by (j)m. Note that by definition of the lookdown-dynamics, 
the "ingredients" ^m, Cm, am, Cmi^J'mi ^m introduced earlier can be obtained from this, and that 
their joint distributions is as discussed above. 

Moreover, let the initial permutation be independent of X and uniformly distributed on 
Sn- Let Gm be independent of all other random variables and uniformly distributed on 5,,^ 
1 < i < am, m G N. 

Define 9m as above, and 



Q(t) := 9m for tm < t < tm+l- 

Observe that, by Lemma [231 

(yi(t), . . . , YM{t)) := (Xe,(i)(t), . . . , Xe^w(t)) 



(2.13) 



(2.14) 



is a version of the (H, S)-Moran-model. Note that "one-member families" are in this construc- 
tion simply treated as non-participating individuals in the (H, i?)-Moran-model. 
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Y(t) depends only on Y(0), {Xm}tm<t and the the evolution of the type processes between 
birth and death events, so @{t), and hence @{t)~^ is independent of 

Qt := a{iYi{s), YNis)) : s < t) V a{xm : m G N) (2.15) 
due to Lemma 12.41 Therefore, we see from 

. . . = (ye-(t) • • - ^e-w W) (2-16) 

that {Xi{t), . . . , XN{t)) is exchangeable. □ 

Corollary 2.5. Starting from the same exchangeable initial condition, the laws of the empirical 
processes of the (E, B)-Moran-model and the (3, B)-lookdo'wn-model coincide. 

The exchangeability property does not only hold for fixed times, but also for stopping times. 

Theorem 2.6. Suppose that the initial population vectors Y^ (0) in the (B, B)-Moran-model 
and (0) in the (E, B)-lookdown-model have the same exchangeable distribution, and let t be 
a stopping time with respect to {Gt)t>o given by (j2.15p . Then, {X^{t), . . . ,X^{t)) is exchange- 
able. 

Proof. We show that Q{t) is independent of the <T-algebra Qr (the r-past) and uniformly dis- 
tributed over Sn- 

First, assume that r takes only countable many values t^, /c E N. Let A ^ and h : Sn — >• 
IR+, then 

oo 

E(/i(G(r)) 1a) = e( ^ h{Q{tk)) lAn{r=t,}) 

k=l 

oo 

= ^(Eh{eitk))){mAn{r=t,}) 

k=i (2.17) 

oo 

hie)ii{de)Y,^iAn{r=t,} 

k=l 

= j /i(e)il(dG)ElA, 

where il denotes the uniform distribution on Sm- To see that the second equality holds, observe 
that, for fixed t^, @{tk) is independent of Qt^ in the proof of Theorem 12. 2i 

By approximating an arbitrary stopping time from above by a sequence of discrete stop- 
ping times, we see that (|2.17p holds in the general C3;SG clS well. Now, exchangeability of 
(Xf (r), ...,X^{t)) follows as in the proof of Theorem [221 □ 

Remark 2.7. One can also define a variant of the (S, i?)-lookdown model which is more in the 
spirit of the 'classical' lookdown construction from |DK96j . where, instead of a)-c) on page El at 
a jump time each particle simply copies the type of that member of the family it belongs to with 
the lowest level (and no types get shifted upwards). This variant, which is (up to a renaming 
of levels by the points of a Poisson process on R) also the one suggested by adapting [KR08] 
to the 'simultaneous multiple merger'-scenario, has been considered by Taylor & Veber (2008, 
personal communication). The same results as above hold for this variant, with only minor 
modifications of the proofs. Note that the flavour of the lookdown process described above is 
easily adaptable to a set-up with time-varying total population size, which is not obvious for 
the other variant. 
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2.3 The limiting population 

We now construct the limiting E°°-valued particle system X = {Xi,X2, ■ ■ ■) by formulating a 
stochastic differential equation for each level /. These exist for each level and are well defined, 
since the equation for level / needs only information about lower levels. 
The generator (II. 6|) of a pure jump process can be written in the form 

Bf{x) =r {f{m{x,u)) - f{x)) du, 
Jo 

where r is the global mutation rate and m: Ex [0, 1] ^ E transforms a uniformly distributed 
random variable on [0, 1] into the jump distribution q{x, dy) of the process. The random times 
and uniform "coins" for the mutation process at each level I are given by a Poisson point process 
fy^Mut ^ jQ^ 2] with intensity measure rdt du. 

As in Section 12. H denote by 

(tfii , Cm 5 — {tmj (Cml ; Cm2 5 • • •) ' (^ml ) >•••)) 

the points of the Poisson point process dJt^" and recall the definition (j2.2p of the "colour" 
function g. Based on this, define 

:= Yl Yl'^{9iU,^mj)<00} n l{9(Cm,«™,) = Oo}, (2-18) 

m:tra<tj&J JG{1,...,/}\J 

for J C {1, ...,/} with I J| > 2 . L^j{t) counts how many times, among the levels in {1, . . . , Z}, 
exactly those in J were involved in a birth event up to time t. Moreover, let 

Lj,kit) ■■= Yl n ^{9(U,«™.)=fc} n ^{9{Cm,u^j)y^k}- (2-19) 

m:tm<tj&J je{l,...,«}\J 

L^jf^{t) counts how many times, among the levels in {1, . . . , /}, exactly those in J were involved 
in a birth event up to time t and additionally assumed "colour" k. 

To specify the new levels of the individuals not participating in a certain birth event, we 
construct a function Jm as follows: 

Denote by /i^ := min{/ S \ g{Cm,Umi) = k} the level of the parental particle of family 
number k and by Mm '■= {/^mlfceN the set of all levels of parental particles involved in the m-th 
birth event. Furthermore Um ■= {I S \ g{Cm,Umi) = oo} denotes the set of the levels not 
participating in the birth event m. Define the mapping 

Jm : Um^n\Mm (2.20) 

that maps the i-th smallest element of the set Um to the i-th smallest element of the set N \ Mm 
for all i. 

Assuming for the moment that E is an Abelian group, the (infinite) vector describing the 
types in the (H, i?)-lookdown-model is defined as the (unique) strong solution of the follow- 
ing system of stochastic differential equations. The lowest individual on level 1 just evolves 
according to mutation, i.e., 

Xi{t) := [ im{Xi{s-),u) - Xi{s-))dmY''\s,u). (2.21) 

J[0,t]xlO,l] 
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The individuals above level one can look down during birth events. Thus, for I > 2, define 
Xi{t) ■.=Xi{0) + / {m{Xi{s-),u) - Xi{s-)) dmf^\s,u) 

J[0,t]x[0,l] 

+ ^ f\x,{s-)-Xi{s-))dm^{s) 

+ I {Xi-i{s-)-Xi{s-))d^f^{s) 

i<i<j<r^ 

+ E E j\x^MK){s-)-Xi{s-))dL'K^^{s) 



(2.22) 



km K(Z{l,...,l},leK 



+ f\xj^(^i){s-)-Xi{s-))dL'K{s). 



Kc{l,...,l},l(^K 

The second and third lines describe the "Kingman events" , where only pairs of individuals 
are involved. The first part copies the type from level i when I looks down to this level, because 
it is involved in a birth event and the parental particle is at level i. The second part handles the 
event that the parental particle places a child on a level below /. In this case, I has to copy the 
type from the level / — 1, since the new individual is inserted at some level below / and pushes 
all particles above that level one level up. 

The fourth and fifth lines describe the change of types for a birth event with large families 
in a similar way. If the particle at level I is involved in the family k, it copies the type from 
the parental particle which resides at the lowest level of the family. If level I is not involved in 
any family, then Jm,(0 (— gives the level from where the type is copied (which comes from 
shifting particles not involved in the lookdown event upwards). 

Since the equation for Xi involves only Xi, . . . ,Xi and finitely many Poisson processes, it is 
immediate that there exists a unique strong solution of (|2.21|) - (|2.22|) . 

In the case where E has no group structure, one may still construct suitable jump-hold 
processes Xi, using the driving Poisson processes in an obvious extension of (j2.2ip - (j2.22p . 

These stochastic differential equations determine an infinitely large population vector 

X{t) := {Xi{t),X2{t),...) (2.23) 

in a consistent way, and for each € N, the dynamics of {Xi,... ,X]\f) is identical to that 
defined in Section 12.21 In particular, we see from Theorem 12.21 that, for each t > 0, X{t) is 
exchangeable and the empirical distribution 

1 ' 

Z{t) := limZ'(t) := \im jY^x^it) (2-24) 

I— too l-^QO L ^ — ' 

2 = 1 

exists almost surely. Let F be the set of bounded measurable functions ip : [0, oo) x [0, 1]^ x 
[0, 1]°° — > M such that ip{t, u) does not depend on u, and put 

Ht := a(^{Z{s) : s <t),(^J ipdm-" : <f e F^i^. (2.25) 

Corollary 2.8. Let t be a stopping time with respect to {T~Lt)t>o- Then 

X{t) = {Xi{t),X2{t),...) (2.26) 

is exchangeable. 
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Proof. We claim that for t > 0, A G Ht with ¥{A} > and n G N, 

{Xi{t), . . .,Xn{t)) is exchangeable under f{-\A}. (2.27) 

Observe that, taking A = {t = t^}, ()2.27p immediately implies the result for discrete stopping 
times r, from which the general case can be deduced by approximation as in the proof of 
Theorem 12.61 

Obviously, (j2.27p is equivalent to 

p{An{(Xi(t),...,x„(t)) gC}} =p{^n{(x,(i)(t),...,x,(„)(t)) gC}} yccE^,aeSn. 

(2.28) 

As the collection of sets A from Tit satisfying ()2.28p is a Dynkin system, it suffices to verify 
(|2.28p for events of the form 

A = {Zisi) G Bi, . . . , Z{sk) G Bk} n H', (2.29) 

where H' e a{J tpdM^o : cp G F) , k e N, si < ■ ■ ■ < Sk < t, Bi e B{si) for i G {1, . . . , A;}, 
and B{si) is a fl-stable generator of Bj^4^(^e) with the property that P{Z(sj) G dB'} = for all 
B' G B{si). 

For A as given in ([2:29]) . e > and n G N, cr G 5'„, C C -E" appearing in ([228]), by (f2:2i|) 
there exists I {I ^ n) such that 

Ai := {Z'{si)eBi,...,Z'{sk)eBk}nH' 

satisfies ¥{{A\Ai)U {Ai \A)} < e. By the arguments given in the proof of Theorem 12. 6^ ()2.28p 
holds with A replaced by Ai. Finally, take e ^ to conclude. □ 



3 Pathwise convergence: Proof of Theorem 11.11 

Recall the empirical processes Z\ and their limit Z, from ()2.24p . Obviously, for each / G N, the 
process {Z\t))t>o has cadlag paths. To verify the corresponding property for Z, we introduce 
the following auxiliary (Levy) process C/, derived from Poisson point process 9Jt^'' which governs 
the large family birth events of the population X: If {(tm; Cm, Um)} are the points of the process 
we define 

U{t) := Y^vl, (3.1) 

where Vm '■= ^iZi Cmi- The jumps of U := {U{t))t>o are the squared total fractions of the 
population which are replaced in large birth events. The generator of U is given by 

Df{u) = [\f{u + v^)-f{u))u{dv), (3.2) 
Jo 

where the measure u on [0, 1], defined via 

governs the jumps. 

We need the following version of Lemma A. 2 from |DK99| . 
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Lemma 3.1. a) Let ei, 62, . . . he exchangeable and suppose there exists a constant K such that 
1 6,1 < K almost surely. Define 

k 

Mk := ^^e, (3.4) 

i=l 

and let M^q be the almost sure limit of {Mk)keNj whose existence is guaranteed by the de Finetti 
Theorem. Let e > 0. Then there exists r/i > depending only on K and e, such that, for 
I <ne NU {00}, 

P{|M„ - Mil >^}< 2e-''i(^'^)'. (3.5) 

b) Let (ei(t))tg[o,i] be centered martingales such that maxjgN supjg^,!] ^ ^ almost 

surely and (ei(l), 62(1), . . .) is exchangeable. Put 



k 



Let e > 0. Then there exists r]2 > depending only on K and e, such that, for I E N 

P{ sup \Mk{t)\ >e}< 2e"''2{^,£)/. (3.6) 
te[o,i] 

Proof. The proof of part a) is a straightforward extension of that of Lemma A. 2 from |DK99| . 
which employs the fact that an infinite exchangeable sequence is conditionally i.i.d. together 
with standard arguments based on the moment generating function. 
For part b) observe that by Doob's submartingale inequality, 

P{^s.,p lAftWI >4 < inf ji,E.^I*"«l < inf jJ,Eexp(^|:|.,(l)|). (3.7) 

Now proceed as in part a). □ 

The following lemma provides the technical core of the argument and replaces Lemma 3.4 
and Lemma 3.5 in |DK99] . The proof given below follows closely the arguments of Donnelly 
and Kurtz |DK99| . 

Lemma 3.2. Ln the setting of Theorem for all c,T,e > and f € 'T>{B) (the domain of 
the mutation generator) there exists a sequence 6i such that "^"iZi Si < oo and 



sup \{f,Z{t))-{f,Z\t))\>ne,U{T)<c} <6i. (3. 



0<t<T 

Proof. By Lemma l3. II and the exchangeability properties of X, we have 

P{|(/,Z(a)) - {f,Z\a))\ >e} < 2e-^''^ (3.9) 

if a is a stopping time with respect to H := {Ht)t>o '■= {o'iU {s) : s > 0)Vcr(Z(s) : < s < t))^>Q 

(observe that Tit C Tit, where fit is defined in (j2.25p l. 
Now fix I and e. Define the W-stopping times 

ai := inf|t:[/(t)>i|Ai (3.10) 

and 



ao+i := mf{t:U{t)>U{ao) + -^> Alao + j^] , o=l,2,..., (3.11) 
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which yield a decomposition of the interval [0,T]. Note that on the event |f7(T) < c} there 
exist at most 

oi ■.= 2{c + T)l^ (3.12) 

such Oo, i.e., we have 

F{ao, < T, U{ao,) < c, U{T) < c} = 0. (3.13) 
We define a second kind of H-stopping times depending on via 

ao ■■= inf {t > Go : I (/, Z{t)) - (/, Z{ao)) | > 6e}. 



We see from ([33]) that 



Ho := \{f,Ziao)) - {f,Z\ao))\ V |(/,Z(ao)) - {f,Z\do))\ 



satisfies 



sup 

0<0; 



1 

Ho > e,U{T) < c| < ^¥{Ho > e,U{T) < c} < 8{c + T)l 



(3.14) 
(3.15) 

(3.16) 



o=l 



It remains to estimate the variation of Z'' and Z in between the stopping times Oo- For 
u € [ao,ao+i) let (3jo{u) denote the smallest index of a descendant of Xj{ao), let the stopping 
time 7jo be the time when the smallest descendant of Xj{ao) is shifted above the level I. Put 



X. 



.^fto(7.o-)(7io-) ifn>7jo. 
Observe that 

z I 

{f,Z\u)) - {f,Z\ao)) = {f,Z\u)) - Y.f{X,{u)) + Y.(f{X,{u)) - f{X,{ao))). 



(3.17) 



It will be useful to treat the two parts of the sum separately. Define 

(/,Z'(n))-yj;/(X,(^)) 



Ki := max sup 

o<oi u(i[ao,cxo+i) 



and 



K2 := max sup 



i^(/(X,(n))-/(X,(ao))) 



Note that the law of K2 depends only on the mutation mechanism, since Xj[u) follows the line 
of the individual Xj{ao) = Xj{ao) and thus only evolves independently according to a mutation 
process with generator B. 

Begin with Ki and note that, for u € [a^ Oo+i), 

(/,Z'(n)) - i^/(X,(n)) = i(^/(X,(n)) - J^/(X,(^))) < ^Ar'[a„, a„+i), (3-18) 



where A^'[ao,ao+i) is the total number of births occurring in the time interval [00500+1) with 
index less than or equal to /. To see this note that at time Oq the two sums in the second 
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expression cancel. A birth event in the interval [uq, cto+i) means that one type is removed from 
the second sum and another one is added, thus the expression can be altered by up to 2||/||/Z. 

There are two mechanisms which can increase A^'[ao) cto+i)- It can either increase during a 
large birth event given by a "jump" of 9Jl^" or during a small birth event which is given by one 
of the "Kingman-related" Poisson-Processes '^fj- 

We first consider large birth events. Let [vi] be the jumps of U in the interval [ao-,Oio+i), 
and condition on this configuration for the rest of this paragraph. At the time of the m-th 
jump, a Binomial(/, t'm,)-distributed number of levels < I participates in this event, hence km-, 
the total number of children below level I in the m-th birth event, satisfies 

km < (bm - 1) + , 

where bm is Binomial(/, ?;m)-distributed. Note that we can subtract 1 from the binomial random 
variable, since at least one of the levels participating in the birth event must be a mother. This 
subtraction will be crucial later on. 

By elementary calculations with Binomial distributions, involving fourth moments, similar 
to |DK99l p. 186], we can estimate 

*™ > ei} < P{ - 1)+ > el} < ^ (3.19) 



for some < Ci < oo. 

As we mentioned before, A^'[ao,ao+i) and thus Ki can also be increased by the Kingman 
part of the birth process, but only if the parental particle and its offspring are placed below level 
I. The number of times this happens in the interval [ao, cko+i) is stochastically dominated by a 
Poisson distributed random variable R with parameter (2)^"^ since the length of the interval is 
bounded by Z~^. So, the probability that ^Y^A^'[ao) Oo+i) exceeds 2e due to this mechanism is 
bounded by the probability that R exceeds jj^. By elementary estimates on the tails of Poisson 
random variables, we have 

P{i^>^}<e-^^^ (3.20) 

for some k > and / large enough. 

Combining (|3.19|) and (|3.2U|) . we obtain 

1 ' 

P{i^i > 2e, [/(T) < c} = P| max sup |(/, Z'(n)) - - V /(X,(n))| > 2e, [/(T) < c| 



'^1 . --rill 



for / large enough. This controls the increments of {f,Z'') in the intervals [ao,ao+i] 
We now consider K2. Observe that 

I I 



(3.21) 



1 7 = 1 ^ -^Cfo ^ 

r Bf{Xj{s))ds, (3.22) 

J a„ 



IE 



and that, for u > Oq and each o 

I 

I . 

Mio{u A Oo+i 

j 



1 ' / j-u/\ao+\ \ 

- \J\X,{u A Qo+i)) - f{X,{ao)) - j BfiXj{s))dsj (3.23) 
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is a martingale. For I so large that / ^||-B/|| < e, we have 

oi-l 



> 2e, U{T) < c} < ^ P{ sup \Mio{u) + T Bf{X,{s))ds\ > 2e, U{T) < c} 

0,-1 

<J2^{ \Mioiu)\+r^Bf\\>2e,UiT) <c} 

o^Q ^ ao<u<ao+i ^ 



oi-l 

< sup \MUu)\>e,UiT) < c|. 

0=0 ao<u<ao+i 



(3.24) 



We now need to bound each summand. Using the notation 

1 ' 

i=i 

where 

ej{u) := f{Xj{ao+i A n)) - /(X.K)) - / u G [0, 1], (3.25) 

each (ej(n))t( is a martingale with Eej(ti) = and \ej{u)\ < 2||/|| + =: K almost surely. 

Moreover, the ej{u) are exchangeable. We obtain from Lemma l3. II 

sup \Mioiu)\ > e\ < 2e~''^^ , (3.26) 

ao<u<ao+i ' 

for some ??2 > 0. 

Combining this result with (|3.24|) . we arrive at 

¥{K2 > 2e, U{T) < c} < o^C'ae-''^^ (3.27) 

Now observe that if maXo<oj Ho < e, Ki < 2e and K2 < 2e, then Oq > Oio+i- This can easily 
be seen by contradiction. Indeed, if we assume that Oq < Oo+i, this would imply 

K/,Z(ao))-(/,Z(do))| >6e, (3.28) 

according to (j3.14|) . But on the other hand we know that 

\{f,Z{ao))-{f,Z'{ao))\<ea.nd\{f,Z{ao))-{f,Z'{do))\<e Vo (3.29) 

due to our bound on Ho- Since the distance between {f,Z) and {f,Z'-) was at most e at 
the beginning of the interval and (/, Z') can only have moved by at most 4e on the event 
{Ki < 2e} n {K2 < 2e} n {maXo<o, Ho<e}, 

\{f,Z{ao))-{f,Z'{do))\<5e (3.30) 

must hold if do < Oo+i- But equation ()3.28p states that, {f,Z{do)) is more than 6e away from 
its starting point, so this contradicts that it can only be e away from (/, Z^(do)) which is ensured 
by our condition on Ho- Thus do has to be greater than ao+i which in turn implies that 



sup 

ao<'it<«o+i 



{|(/,Z(n))-(/,Z(ao))|} <6e (3.31) 
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holds on the event {Ki < 2e} n {K2 < 2e} n {maXo<o, Ho < e}. 

Putting observations (13.160 and ()3.3ip . the bound (13.271) and the bound ()3.2ip together, we 
finally obtain 

P| sup \{f,Z{t)) - {f,Z'{t))\ > ne,U{T) <c\<6i (3.32) 



0<t<T 
with 

5i := 8(c + T)l^e-'^^ + oiCir'^ + oie-''^^ + o^Cze-''^^ (3.33) 

which is the statement of the lemma since due to equation (|3.12|) o/ ~ holds and therefore 
the 5i are summable. □ 

Proof of Theorem Almost sure convergence of Z' to Z with respect to the metric (II. 8p 
follows directly from Lemma 13.21 and the Borel-Cantelli Lemma, completing the proof of Theo- 
rem 11.11 □ 



4 The Hille-Yosida approach 

In this section we provide two alternative representations of the So-Fleming-Viot generator, 
leading to the distributional duality to the H-coalescent discussed in Section [H and we show 
that they generate a Markov semigroup on A4i{E), hence leading to a classical construction of 
the Ho-Fleming-Viot process as a Markov process. 



4.1 Two representations of the So-Fleming-Viot generator 

Recall that if the type space S is a compact Polish space (which is assumed in this paper), then 
the set Aii{E) of all probability measures on E, equipped with the weak topology, is again 
a Polish space. We briefly recall the notation from Section [TJ For / : E"^ — > M bounded and 
measurable consider the test function 

Gfifi) := [ /(:ri,...,x„)/x^"(dxi,...,dx„), e Mi{E). (4.1) 

The linear operator L"" was defined via 

L^-Gf{fi)=[ [ [Gy((i-|C|)^. + E£iCAO-G/(^)]^nrfx)%^. (4.2) 

This operator is the Ho-Fleming-Viot generator from Proposition 1 1.31 The following representa- 
tion will be useful to establish the duality with the Ho-coalescent. Note that if H is concentrated 
on E A : Q = for all i > 2}, i.e., if the corresponding coalescent is a A-coalescent, then 
this result has already been obtained by Bertoin and Le Gall |BLG031 Eqs. (16) and (17)]. 
For convenience, we will denote the transition rates by 

X{kl,...,kp) = Xb;kl,...,kr;S, (4.3) 

where ki > • • • > kr > 2, p — r = s and fc^+i = . . . = kp = 1. Furthermore, define for 
p,ni, . . . ,np £ N such that ni + ■ ■ ■ + rip > p (<J4> not all = 1) 

A(ni,...,np) := A(A;i, . . . , A;^), (4.4) 

where ki >■■■> kp is the re-arrangement of rei, . . . , in decreasing order. 
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Lemma 4.1. The operator L'^" has the alternative representation 

L~°Gf{fi) = V A(|^i|,...,|^p|) / (/(xH) -/(x))/i^"(dxi,...,dx„), (4.5) 



■rr={Ai,...,Ap}eV„ 
not all singletons 



where x[{Ai, . . . , Ap}] £ E"^ has entries 

(x[{y4i, . . . ,ylp}])i := XminA, if i £ Aj , i = I, . . . ,n. 
Remark 4.2. Note that (|4.5|) basically boils down to p.4|) . if \Ai\ = 1 for all but one Ai. 
Proof of Lemma First note that for fixed C, and x, 

G/((l-|CI)/^ + E£lCA,:) 

= E n C<^0-) / /(r?(0,x,y))/.®»W(dyi,...,(iy,(^)), 

</.:{!,.. .,n}->Z+ j<n:0(j)>O -"-^ 

(4.6) 

where a{(j)) := < j < n : = 0} and ?7((/), x, y) G i?" is given by 



X4>{j) if 0(j) > 0, 
yfc if 0(j) = 0, where = #{1 < j' < j : (/>(i') = 0}. 



Identity (j4.6|) can be understood as follows: Expanding the n-fold product of (1 — |C|)/^ + 
Ci<^Xi) we put 0(j) = if in the j-th factor, we use (1 — ICI)^, and we put (/)(j) = i if we 
use Qi^Xi in the j-factor. 

Each : {1, . . . , n} ^ Z+ is uniquely described by a partition tt = {Ai, . . . , Ap} € "Pn with 
labels £i, . . . , £p G Z+ by defining j ~0 j' if and only if (/)(j) = (/>(j') > and putting li := 4>{Ai), 
i = 1, . . . ,p. Note that for a given partition {Ai, . . . , Ap}, any vector (£i, . . . , ip) £ Z!j_ of labels 
with the properties 



£^ = ^ \Ai\ = l and i / j, ^j, / ^ li 7^ £j 
is admissible. Thus we have 

/ G^((l-|CI)^ + E£iO40^^^dx) 

E E(l - ICI)*^ ""''=°^ n Clf' / /(x[vr])/.««(dx). (4.7) 

admissible -^^i 

Note that, for a given partition with p blocks, the integration appearing in the last line 
runs effectively only over E^. For further simplification assume that the blocks Ai, . . . , Ap of 
TT = {Ai, . . . , Ap} £ Vn are enumerated according to decreasing block size, and write s{'k) 
for the number of singleton blocks of the partition vr = {Ai, . . . , Ap}. Then, for a given 
TT = {^1, . . . , Ap} G Vrii the last sum in (14. 7|) can be written as 

E iV) - 1^")'^'^^"' ^ • • • ^''l.T.r' / /(x[vr])/.«"(dx). 

all distinct 
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Furthermore, for any C ^ ^ ^^'^ n € N, 



1 = ((i-ICI) + E£iC 

E ECf)(i-i«R"""Eclf"-c&: 



\\-s(n) + l\ 

n = {Al,...,Ap}e'Pn 1=0 ^ -' n,---'ip-s(n) + li 



all disticnt 



This allows us to re-express the inner integral in (j4.2p as 

s{w) 



A 1 ^-n ;— n ^ ^ ii i , , , ,<=n 



iT={Ai,...,Ap}£V„ 1=0 ^ ^ ii,---,ip_s(^)+; 

all distinct 



not all singletons ^\\ distinct 



X / [/(xH)-/(x)]^«"(dx), 



because x[{{l}, . . . , {n}}] = x. Integrating this equation over A with respect to the measure 
(C,C)"^So yields (03]). Note that (see also [HMl p. 844]) 



^={Ai,...,Ap}eT'„ 1=0 ^ ^ n,...,ip_^(^)+;6N 

not all singletons all distinct 

s{7r) 



- E ( E E ( / ) ~ ici)'*'''^^ ' E Cip_s(^)+i ■ • • Cip_s(^ 

^={Ai,...,Ap}eT'„ ii=l /=o ^ ^ ip_,(„)+i,...,ip_,(^)+,GN 

not all singletons 

""'-'Kl' = (lPnl-l)(C,C) 



E (cc)E(t')*'-i«i 



7r = {A 
not all singletons 



to verify that there is no singularity near C = 0- D 

4.2 Construction of the Markov semigroup and proof of Proposition 11.31 

The following proposition ensures that there exists a Markov process attached to the Hq- 
Fleming-Viot generator. 

Proposition 4.3. The closure of {{G f , G f) : n G N, / : -E" M bounded and measurable} 
generates a Markov semigroup on A4i{E). 

Proof. We write G instead of G/ for convenience. By the Hille-Yosida theorem (see, for example, 
|EK861 p. 165, Theorem 2.2]) it is sufficient to verify that 

(i) the domain D is dense in C{A4i{E)), 

(ii) the operator L"'^ satisfies the positive maximum principle, i.e., L^^^G{^) < for all G G D, 
/X G Mi{E) with supy£_;v(^(E) G(i^) = G(//) > 0, and that 
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(iii) the range of A — is dense in C{A4i{E)) for some A > 0. 

In order to verify (i) and (iii) we mimic the proof of Proposition 3.5 in Chapter 1 of |EK86j 
and construct a suitable sequence Di,D2, ■ ■ ■ of finite-dimensional subspaces of C{Mi{E)) such 
that D := UfceN -^^s dense in C{Mi{E)) and L"" : ^ D^. for all /c G N as follows. For 
n € N and / : E"^ — > R bounded and measurable let Df denote the set of all linear combinations 
of elements from the set 

{G : G(^) = //(x[7r])^^"(dx),^ G Vn}. 

Since [Vnl < oo, it is easily seen that Df is a finite-dimensional subspace of C{A4i{E)). From 
(Ii3]) it follows that L^" : Df Df. For each n G N let {gnm : m G N} C C(£;") be dense, 
and let {/^ : k G N} be an enumeration of {gnm '■ n,m (z N}. Then, := Df^, A; G N, has 
the desired properties. Note that D := UfceN is dense in C{Aii{E)) (Stone- Weierstrass), i.e. 
condition (i) holds. 

We have (A — L^^)[Dk) = D}^ for all A not belonging to the set of eigenvalues of L^^\£)^, 
i.e., for ah but at most finitely many A > 0. Thus, (A - L-o)(D) = (A - ^^")(UfcGN -^fc) = 
U^gj^Dfc = D is dense in C{M.i{E)) for all but at most countably many A > 0. In particular, 
condition (iii) is satisfied. 

Condition (ii) follows from the fact that the expression inside the integrals in ()1.12p satisfies 

G'((l-|C|)^ + E^=laxJ-G(/x) < sup G(i.)-G(^) = G(/x)-G(/x) = 

u€Mi{E) 

for all X = {xi,X2, . . .) G E^ , C G A, G G and jj. G Mi{E) with sup^g_A4i(£;) '^(^) = '^(/^)- 

Thus, the Hille-Yosida theorem ensures that the closure L^o of L"" on C{A4i{E)) is single- 
valued and generates a strongly continuous, positive, contraction semigroup {Tt}t>o on M.i{E). 
Note that from (iii) it follows that D is a core for L^o ([E K861 p. 166]). The operator L"" 
maps constant functions to the zero function, i.e., is conservative. Thus, {Tt}t>o is a Feller 
semigroup and corresponds to a Markov process with sample paths in L'_a/(^(£;)([0, oo)). □ 

Remark 4.4. i) If the finite measure H on A allows for some mass a := H({0}) at zero, 
then L^" has to he replaced by L" := + L'^'^", where is defined as before and L""^" is 
the generator of the classical Fleming- Viot process jFV79j given by (II. lip . The existence of a 



Markov process Z = {Zt)t>o with generator can be deduced as in the proof of Proposition 
\4-3\ via the Hille- Yosida theorem. 

ii) The construction of the Markov process attached to the 'full' generator L, including the 
Kingman component (II. lip and the mutation component (I1.13|) . works via the standard Trotter 
approach. 

iii) Note that J{L^)Gd6s^ = 0, x & E, where 5y G M.i{A4i{E)) denotes the unit mass at 
V G M.\{E). Thus, see \EK8(^ p. 239, Proposition 9.2], the states 5x, x £ E, are absorbing for 
the r.-Fleming-Viot process. 

We now turn to the proof of Proposition 11.31 Indeed, we verify the following 

Claim: The distribution of the measure valued Markov process with generator L, as defined 
in Remark 14.41 ii). coincides with the distribution of the (H, i3)-Fleming-Viot process, as defined 
in Theorem [UJ 



It suffices to verify the following lemma. 

Lemma 4.5. The (E, B)-Fleming-Viot process defined in Theorem solves the martingale 
problem for the generator L given in (jl.lOp . 
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To prepare this, let us concentrate on the case when there is no mutation and no Kingman- 
component {L = L^"). Fix / and suppose we are at the m-th birth event. As in the previous 
section, let {4>ln, ■ ■ ■ denote the assignments of the levels to one of the am families. So 

(t)\^ C {1, . . . , ^} and (t)\^{^4)\^ 7^ ^ hJ- Furthermore, we again denote by '■= \Ji=i 'Pm 

individuals participating in the birth event. Note, that this can be a strict subset of {0, ... , 
and {(filn: • • • ! (f'riT } holds all information about what is going on at the birth event. The function 
g{C,u) is defined as in (j2.2p . We introduce a Poisson process counting the number of times 
a specific birth event {(pl^, . . . , cj)^} happens. With {tm,Cm,'^m) denoting the points of the 
Poisson point process we define 



dm 



L{^i^,...,^'^}{t) ■= j2 J2 n n ^{9{u,u,„j)=b,} n i{9(u,«m.)=<^}- (^-s) 



all distinct 



To describe the effect of the birth event {(p^, ■ ■ ■ , 4''!^} the population vector x £ E'- we 
introduce the function T defined by 




for all k G {1, . . . , Z}, where Jm is the function defined in (j2.20p that holds the information on 
where the non-participating particles should look down to. 

With this notation we can use equation ()2.22p and the dependence between the L j ^ and Lj 
to show that 

X^(t) := X'(0) + f {^{4>I....^V^}{X\^-)) - (4.10) 



U<*?„C{1,...,0 



describes the evolution of the first I levels £ E\ if we assume no mutation and no Kingman 
part. Note that for simplicity we use the notation X^ = (Xi, . . . , Xi). 

Since the Ls^^i^^^^^^^<^-^{t) are Poisson processes derived from the Poisson point process 971"'' 
it is straightforward to verify that their rates are given by 

r({0L...,C-}) := E //^"^'•■■Ct--''a..---C.Jl-KI)('-l*l)=^ (4.11) 



all distinct 



A 



where + 1 = \4'm\ before and the sets are ordered, such that k}^ > • • • > — ^ ^'^'i 
kl^^ = ■ ■ ■ = k'^ = hold. Assume that at least A;^ > 1 holds, because otherwise T is the 
identity. Note that under this assumption the integral in ()4.1ip is finite (c.f. [SOO] or [S03] ). 
We now turn to the actual proof of the lemma. 

Proof of Lemma |^.5[ . We will prove the result for the generator . The full result can then 
be obtained in analogy to the proof of Theorem 2.4 in [DK96J. 

Indeed, we have to show that for each function G/ € P(-L"") of the form 

G/(/u) = (/,^«^ (4.12) 

for ^ G JV[i[E) and /: — > R bounded and measurable, 

Gf{Z{t)) - Gf{Zm - f{L^^^Gf){Z{s)) ds (4.13) 

JO 
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is a martingale with respect to the natural filtration of the Poisson point process 9Jt"° given by 



{Jt} 



t>o ■- 



[CtlxAxp,!]**^ J t>0 



Note that 



E 



f{Xi{s),...,Xi{s)) 



Jt 



E 



Jt 



(4.14) 
(4.15) 



holds for all s,t >0, which will be crucial in the following steps. 

We start by observing that, for < w <t, the representation (I4.10p leads to 



= E 



[f{X\t))-f{X^iw)) 



Ju 



(4.16) 



[J<i,l^C{l,...,l} 



since this is a martingale. 

Using the definition of the rates (j4.1ip and the fact that due to the exchangeability of X\ 
the action of Tj^i and the [vr] operation under the expectation is the same, we can now 

rewrite the last term (without the substraction of /(X'(s)) from the integrand) as 



E 



U0?„C{1,...,!} 



E 



E 



E 



E E 

■K={Ai,...,Ap}ePn {'■1 >••■.'>) 
admissible 



(1-ICI 



^#{n=o} 



(dC) 



r,>0 



(CO 



f({X^{s))[n]) ds 



Ju. 



w J A 



7r={Al,...,Ap}G'P„ (ri,--,rp) 



A Jen 



G;((l - \C\)Z{s) + E£i C.5.0^(s)^''(dx 



1=1 

r,>0 



go(rfC) 

(CO 



Jw 



(CO 



(4.17) 



since the sum about the configurations {^^i • • • > '/'^^TI ^^'^ distinct indices ii, . . . , can 
be rewritten as the sum about the partitions vr and the admissible vectors (ri, . . . , rp). The last 
equality holds due to equation (j4.7p . 

Combining equation ()4.16p with equation (I4.17|) we see that 







E 



f{x\t))-f{X^{w)) 

{Gfid - \C\)Z{s) + CA,) - G/(Z(s)))Z( 



to J A 



J/ , ^^o(^^0 , 



(/,Z(t)^')-(/,^(^«)^')- / iL-"Gf)iZ{s))ds 



J-IL 



Gf{Z{t))-Gf{Ziw))- / (L=oG/)(Z(s))ds 



holds, where we use (I4.15P in the second equality. Thus, (I4.13P is a martingale. 



(4.18) 
□ 
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5 Dualities 



5.1 Distributional duality versus pathwise duality 

We first establish a distributional duality in the classical sense of [L85]. Indeed, (j4.5|) and results 
about the classical Fleming- Viot process bring forth the following duality between a H-coalescent 
n = {Ilt)t>o and a H-Fleming-Viot process Z = {Zt)t>o- 

Lemma 5.1. (Duality) For n G N, / : E'^ M bounded and measurable, fi G A4i{E), tt £ Vn 
and t > 0, 



/ /(xH)zr(dx)l =E-[ / /(x[nj")])^^"(dx)l, (5.1) 



where nl"^ is the restriction of IIj to Vn ■ 

To obtain a pathwise duality, we use the driving Poisson processes of the modified lookdown 
construction to construct realisation-wise a H-coalescent embedded in the H-Fleming-Viot pro- 
cess. 

More explicitly, recall the Poisson processes L j and L j ^ from equation (12.181) and ()2.19p in 
Section 12.31 and the Poisson process defined in Section 11.31 For each t > and / S N, let 
-^t('5))0 < s < t, be the level at time s of the ancestor of the individual at level / at time t. In 
terms of the Lj and ij^, the process NI{-) solves, for < s < t, 



i<i<j<l 
KC{1,...,1} 

-E E [\Nl(.^+)-^^iK)n{NUu+)^K}dL'K,M, (5.2) 

kG^Kc{l,...,l} 

where Jm(-) = Jm(u){') is defined by (j2.20|) and m{u) is the index of the jump at time u. Fix 
< r and, for t <T, define a partition Uf of N such that k and I are in the same block of Hf 
if and only if N!j,{T — t) = N^{T — t). Thus, k and / are in the same block if and only if the 
two levels k and / at time T have the same ancestor at time T — t. Then ( |DK99j . Section 5), 

the process {Il'[)o<t<T is a H-coalescent run for time T. (5.3) 

Note that by employing a natural generalisation of the lookdown construction using driving 
Poisson processes on M and e.g. using T = above, one can use the same construction to find 
an H-coalescent with time set M+. We would like to emphasise that the lookdown construction 
provides a realisation-wise coupling of the type distribution process {Zt)t>Q and the coalescent 
describing the genealogy of a sample, thus extending ()5.ip . which is merely a statement about 
one-dimensional distributions. 
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5.2 The function- valued dual of the (S, S) -Fleming- Viot process 

The duality between the H-Fleming-Viot process and the H-coalescent estabhshed m Section [5TT] 
worked only on the genealogical level, the mutation was not taken into account. However, it is 
possible to define a function-valued dual to the (H, S)-Fleming-Viot process such that not only 
the genealogical structure, but also the mutation is part of the duality. This kind of duality is 
well known for the classical Fleming- Viot process, see, e.g., Etheridge \E00\ Chapter 1.12]. 

First note, that due to Lemma |4. II we can rewrite the generator of the (S, i?)-Fleming-Viot 
process given by equation (jl.lOp to obtain 



LGf{fi):=a ^ / (^/(xi,.., Xj,.., Xj,.., - /(xi,.., Xj,.., x^,.., a;„)^/u®"((ix) 

l<i<j<n'' 

+ J2 mil---AA,\)[ (/(x[7r])-/(x))^«"(dx), 

n={Ai,...,Ap}eV„ 
not all singletons 

+ rV / i?i(/(xi,...,x„)K"(dx). (5.4) 

We can now reinterpret the function Gf{fj,) acting on measures as a function Gf^{f) acting on 
the functions Cb{E'^). This reinterpretation transfers the operator L acting on C(A^i(S)) to 
an operator L* acting on Cb{Cb{E^)Y Let C := [j'^^iCb{E'^). A C-valued Markov process 
{pt)t>o solving the martingale problem for L* can then be constructed as follows: 

• If Pf(x) G Cb{E"-) and n > 2, then the process {pt)t>o jumps to /9t(x[7r]) with rate 
A([^i[, . . . , !^p[) + al{3!|A,|=2;Vjyi:|yljl=i}, for ah vr = {Ai, . . . , Ap} G where \Aj\ > 1 
for at least one j. 

• If /?t G Cb{E), that is it is a function of a single variable, then no further jumps occur. 

• Between jumps the process evolves deterministically according to the "heat flow" gener- 
ated by the mutation operator (|1.6p . independently for each coordinate. 

Note that this process is not literally a coalescent, but has coalescent-like features. 

The duality relation between pt and Zt immediately follows from (|5.4p and can be written 
in integrated form as 

Ez,{P0,Zr) = ^PoiPuZr)- (5.5) 

It can be used for example to show uniqueness of the martingale problem for L via the existence 
of {pt)t>o or to calculate the moments of the (H, i3)-Fleming-Viot process. 

5.3 The dual of the block counting process 

In this section, we specialise to the case where the type space E consists of two types only, 
say E = {0,1}. Define the real-valued process Y = {Yt)t>o via Yt := Zt{{l}), t > 0. Define 
g : ^Al{E) — [0,1] via g{p) := /^({l}). The generator ^ of y is then given by Af{x) = 
o y) )(//), / G C^([0,1]), where /i depends on x G [0,1] and can be chosen arbitrary, as 
long as g{iJ,) = x. Thus, 

Af{x) = a^^^nx)+ [ [ (/((l-|C|)x + ESiOy.)-/(x))W,x))«^dy)^^, 

^ JaJ{0,1}N (CO 

(5.6) 
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X G [0, 1], / E C^([0, 1]), where B{1, x) denotes the Bernoulh distribution with parameter x. For 
X € [0, 1] let Vi{x),V2{x), ... be a sequence of independent and identically x)-distributed 
random variables. Then, 

Af{x) = a^^^^r(x)+ / / {f{{l-\C\)x + y)-fix))QiCx,dy)^^, 

where Q{C,^x^ .) denotes the distribution of ^iV'A'^)- Hence the process can be considered 
as a Wright-Fisher diffusion with jumps. The situation where H is concentrated on [0, 1] x {0}^, 
i.e., when the underlying H-coalescent is a A-coalescent, has been studied in |BLG05j . 

Note that Af = for f{x) = so y is a martingale. Furthermore, the boundary points 
and 1 are obviously absorbing. 

In analogy to Lemma [5?T] it follows that Y is dual to the block counting process D = {Dt)t>o 
of the S-coalescent with respect to the duality function i7 : [0, 1] x N ^ M, H{x,n) := x" (see, 
e.g., Liggett |L85j), i.e., 

Ey[Yr] = E"[y^*], nGN,yG [0,l],t >0. 

Thus, the moments of the 'forward' variable Yj can be computed via the generating function of 
the 'backward' variable Dt and vice versa. Such and closely related moment duality relations 
are well known from the literature [AH071 IAS051 IM99j . The duality can be used to relate the 
accessibility of the boundaries of Y and the existence of an entrance law for D with Dq^ = oo. 
Note that by the Markov property and the structure of the jump rates, we always have 

P°°(Z)i = 1 eventually) E {0,1} (5.7) 

and either F'^{f]^^^{Dt = oo}) = 1 (if the probability in ([521) equals 0) or limt^oo IP°°(A = 
1) = 1 (if the probability in ()5.7|) equals 1). 

Proposition 5.2. limt^oo^°°{Dt = 1) = 1 if and only if Y , the dual of its block counting 
process, hits the boundary {0, 1} in finite time almost surely, starting from any y G (0, 1). 

Proof. Fix y G (0, 1), T > 0. Construct (Zt) starting from y6i + (1 — y)SQ and no mutations, 
Bf = 0, (and hence Y starting from y) by using the lookdown construction from Section [2T3l Let 
Xi(0), ^2(0), . . . be independent ;B(1, y)-distributed random variables which are independent 
of the driving Poisson processes, and let X„(t), t > 0, n € N, be the solution of (I2.22|) . Let 

D't := \{N^(T-t):nem, 

where N^{s) solves ()5.2p . By ()5.3p . the law of (-Dt)o<i<T is that of the block counting process 
of the (standard- )S-coalescent run for time T. Then by construction (as there is no mutation), 

Xn{T) = Xjvn(0)(0), 

implying 

{D't = 1} C {Yt G {0, 1}} and {D'^ = 00} C {0 < Ft < 1} almost surely, 
which easily yields the claim. □ 

This is related to the so-called 'coming down from infinity'-property of the standard H- 
coalescent (i.e., the property that starting from Dq = 00, Dt < 00 almost surely for all t > 0). 
Recall ([SOO], p. 39f) that a H-coalescent may have infinitely many classes for a positive amount 
of time and then suddenly jumps to finitely many classes. This can occur if H has positive mass 
on Aj := {u = (ui, U2, . . .) G A : ui + . • • + ti„ = 1 for some n G N}. On the other hand [800^ 
Lemma 31], if H(Aj) = 0, then the H-coalescent either comes down from infinity immediately 
or always has infinitely many classes. Combining this with Proposition 15.21 we obtain 



29 



Remark 5.3. Assume that H(Aj) = 0. Then the 'B-coalescent comes down from infinity if and 
only if the dual of its block counting process hits the boundary {0, 1} in finite time almost surely. 

In general, there seems to be no 'simple' criterion to check whether a H-coalescent comes 
down from infinity (see the discussion in Section 5.5 of |S00j ). On the other side, there seems 
to be also no 'handy' criterion for accessibility of the boundary of a process with jumps (and 
with values in [0, 1]), but at least Proposition 15.21 allows to transfer any progress from one side 
to the other and vice versa. 

We conclude this section with a simple toy example for which most quantities of interest, in 
particular the generator A, can be computed explicitly. 

Example 5.4. Fix / G N. If the measure H is concentrated on A; := {(^ G A : Ci + • • • + ^/ = 1}, 
then ()5.6p reduces to 

Af{x) = ^ Yl ^'^■'■■■'"'Ki-^)'-^^^+-'^'n/(i:Uoy.)-/(x))|^. 

S/l,...,j/iG{0,l} 

For example, assume that the measure S assigns its total mass H(A) := l/l to the single point 
(l//,...,l//,0,0,...) € Ai. Then, 

^/(^) = = [if{y/l)-fix))B{l,x){dy), 

where B{l,x) denotes the binomial distribution with parameters / and x. Note that the corre- 
sponding H-coalescent never undergoes more than / multiple collisions at one time. The rates 
([Oil are 

all distinct 

where {l)p := l{l — 1) ■ ■ ■ {I — p + 1) and n := ki + ■ ■ ■ + kp. The block counting process D has 
rates 



aiH hmp — n 



where the S{n,p) denote the Stirling numbers of the second kind. The total rates are gn = 
X]p=i 5np — ^ ~ iOn/l"', n G N. Note that the corresponding H-coalescent stays infinite for a 
positive amount of time ('Case 2' on top of [S00\ p. 39] with H2 = 0). The dual of its block 
counting process hits the boundary in finite time. ■ 



6 Examples 

The first of the two examples in this section presents a model, where the population size varies 
substantially due to recurrent bottlenecks. It is shown, that the H-coalescent appears naturally 
as the limiting genealogy of this model. In the second example we present the Poisson-Dirichlet- 
coalescent by choosing a particular measure for H which has a density with respect to the 
Poisson-Dirichlet distribution. We provide explicit expressions for several quantities of interest. 
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6.1 An example involving recurrent bottlenecks 



Consider a population, say with non-overlapping generations, in which the population size has 
undergone occasional abrupt changes in the past. Specifically, we assume that 'typically', each 
generation contains N individuals, but at several instances in the past, it has been substantially 
smaller for a certain amount of time, and then the population has quickly re-grown to its typical 
size N. This is related to the models considered by Jagers & Sagitov in [JS04j . but we assume 
occasional much more radical changes in population size than |JS04j . Let us assume that the 
demographic history is described by three sequences of positive real numbers (sj)jgN; ih,N)i<=N 
and (6i,Ar)igN5 where < bi^jsf < 1 holds for all i, and the population size t generations before 
the present is given by G{t), where 



Thus, back in time the population stays at size for some time SiN. Then the size is reduced 
to bi^jsfN for the time /j,ArA^. Thereafter it is again given by A^, until the next bottleneck occurs 
after time Si^iN. Note that for simplicity, we have assumed 'instantaneous' re-growth after each 
bottleneck. Furthermore, we assume that the reproduction behaviour is given by the standard 
Wright-Fisher dynamics, so each individual chooses its parent uniformly at random from the 
previous generation, independently of the other individuals. This is the case in every generation, 
also during the bottleneck and at the transitions between the bottleneck and the typical size. 

We now want to keep track of the genealogy of a sample of n individuals from the present 
generation, and describe its dynamics in the limit — > oo. Denote by Il^^'^\t) the ancestral 
partition of the sample t generations before the present. 

Lemma 6.1. Fix (sj)jgN o,iT-d assume that 6j.Ar — > and that li^M Q as N ^ oo. Furthermore 
assume that bi^j\fN oo and that k.^N/bi^N — > 7i > 0. Then 



weakly as N ^ oo on L'-p^ ([0, cx))), where Rt := t + J2i:si+-+Si<t'yi- 

Note that we assume li^N ^ as ^ oo, so the duration of the bottleneck is negligible 
on the timescale of the 'normal' genealogy. We also assume bi^jy — > but Nbi^jy — > oo, i.e., in 
the pre-limiting scenario, the population size during a bottleneck should be tiny compared to 
the normal size, but still large in absolute numbers. The ratio U^n /bi^N is sometimes called the 
severity of the (i-th) bottleneck in the population genetic literature. 

Sketch of proof. Given sequences (sj), (6i,iv) and {U^n)-, classical convergence results for samples 
of size n can be applied for the time-intervals between bottlenecks and "inside" the bottlenecks. 
Since bi^^N — > oo, the probability that any of the ancestral lines of the sample converge exactly 
at the transition to a bottleneck is 0{{bi^]\fN)~^) = o(l), so that naive "glueing" is feasible. □ 

Remark 6.2. Note that bottleneck events with 7i = become invisible in the limit, whereas in 
a bottleneck with 'ji = +oo the genealogy necessarily comes down to only one lineage (and thus, 
all genetic variability is erased). 

Since we fixed the Sj and 7^, the limiting process described in Lemma l6. II is not a homoge- 
neous Markov process and thus does not fit literally into the class of exchangeable coalescent 
processes considered in this paper. Assume that the waiting intervals Si are exponentially dis- 
tributed, say with parameter (3, and that the ji are independently drawn from a certain law C^. 




b^^NN ifN{J2Z~l\si + h,N)+Sm) <t<N^Zlis^ + kN) 



N otherwise. 



n(^'«)(ivt) 



n5o,(n)(^^) 



31 



Thus, in the pre-hmiting A^-particle model forwards in time, in each generation there is a chance 
of ~ 13/N that a 'bottleneck event' with a randomly chosen severity begins. In this situation, 
the genealogy of an n-sample from the population at present is (approximately) described by 

n^'''(")(5t), i>0, (6.1) 

where {St)t>o is a subordinator (in fact, a compound Poisson process with Levy measure fiC^ 
and drift 1). 

Proposition 6.3. Let he the number of lineages at time j > in the standard Kingman 
coalescent starting with Nq = oo, and let Dj be the law of the re- ordering of a (j -dimensional) 
Dirichlet{l, . . . , 1) random vector according to decreasing size, padded with infinitely many zeros. 
The process defined in i6.1]) is the E-coalescent restricted to {1, . . . ,n}, where 

„ oo 

E{dC) = 6o{dC) + {C,C) / TnNa=j)D,{dC)(3C^{da). 

Proof. Recall that the number of families of the classical Fleming- Viot process without mutation 
after a time units is N^^. Given No- = j, the distribution of the family sizes is a uniform partition 
of [0, 1], hence Dirichlet(l, ... ,1). Size-ordering thus leads to the above formula for H. □ 



6.2 The Poisson-Dirichlet case 

The Poisson-Dirichlet distribution PDg with parameter > is a distribution concentrated on 
the subset A* of points C £ ^ satisfying |(^[ = 1. It can, for example, be obtained via size- 
ordering of the normalized jumps of a Gamma-subordinator at time 9. For more information on 
this distribution we refer to |K75| or [ABT99] . Sagitov [S03] considered the Poisson-Dirichlet 
coalescent 11 = {Ilt)t>o with parameter 9 > 0, where (by definition) the measure H has density 
C ^ (Ci C) with respect to PDg. As the measure PDg is concentrated on A*, the rates (14. Sp 
reduce to 

x{ki,...,k,) = j c?---cf PDe(rfC). 

all distinct 

From the calculations of Kingman |K93j it follows that the Poisson-Dirichlet coalescent has 
rates 

Xik^,...,k,) = _Q(fe,-l)!, 

ki,...,kj G N with k := ki ^ 'r kj > j, where [e]k := 9(6 + 1) . . . {6 + k - 1). 

Mohle and Sagitov [MSOlj characterised exchangeable coalescents via a sequence {Fj)j^^ 
of symmetric finite measures. For each j G N, the measure Fj lives on the simplex Aj := 
{(Ci, . . . , Cj) S [0, 1]-'' : Ci + ■ ■ ■ + Cj ^ 1} and is uniquely determined via its moments 

Xiki,...,kj) = [ (^^-^.■■&-^Fj{dCu...,dCj), k^,...,kj>2. 



For the Poisson-Dirichlet coalescent, an application of Liouville's integration formula shows 
that the measure Fj has density fj{Ci, ■ ■ ■ , Cj) '■= 9^Ci ' ' ' Cj(l ~ Z]i=i dY'^ with respect to the 
Lebesgue measure on Aj. 

As H is concentrated on A*, it follows that 

-EidC) = [ 77^H(dC) = / lieidC) = 1 < oo. (6.2) 



A (CO ^ ^' Ja{C,C) JA 
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By |S00t Proposition 29], the Poisson-Dirichlet coalescent is a jump- hold Markov process with 
bounded transition rates and step function paths. By |S00l Proposition 30], for arbitrary but 
fixed t > 0, Jit does not have proper frequencies. 

The block counting process D := {Dt)t>o, where Dt := \Ilt \ denotes the number of blocks of 
Hi, is a decreasing process with rates 

_ n! ^ A(ni,...,nfc) _ ^ 1 - ^ ( i,\ 

ri^H hn^ — n nj^H yn^—n 

k,n (^'N with k < n, where the s{n, k) are the absolute Stirling numbers of the first kind. The 
total rates are 

9n ■= "^gnk = nGN. 

k=i ^ 

Note that gnk = ^{Kn = k}, k < n, where Kn is a random variable taking values in {1, . . . , n} 
with distribution ^ 

F{Kn = k} = -^s{n,k), ke{l,...,n}. 

We have 

n— 1 n— 1 

7„ := ^(n-%„fc = J];(n-fc)P{i^„ = fe} = n - Ei^„ < n. 
fe=i fc=i 

In particular, Yl'^=2^n^ — Yl'^=2 ^/''^ ~ Together with (|6.2p and H(Aj) = 0, where := 
G A I Ci + • • • + = 1 for some n}, it follows from |S00l Proposition 33] that the Poisson- 
Dirichlet coalescent stays infinite. 

If we assume no mutation, then the generator (defined in Remark 14. 4p of the correspond- 
ing Fleming- Viot process reduces to 

L=G/(m) = / / [G/(E£iax.)-G'/(/.)]/.nrfx)PD,(dC). 
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